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Abstract: The mechanisms responsible for the determination of phenotypes are still not well 
understood; however, it has become apparent that modifier genes must play a considerable 
role in the phenotypic heterogeneity of Mendelian disorders. Significant advances in genetic 
technologies and molecular medicine allow huge amounts of information to be generated from 
individual samples within a reasonable time frame. This review focuses on the role of modi- 
fier genes using the example of cystic fibrosis, the most common lethal autosomal recessive 
disorder in the white population, and discusses the advantages and limitations of candidate gene 
approaches versus genome- wide association studies. Moreover, the implications of modifier gene 
research for other monogenic disorders, as well as its significance for diagnostic, prognostic, and 
therapeutic approaches are summarized. Increasing insight into modifying mechanisms opens up 
new perspectives, dispelling the idea of genetic disorders being caused by one single gene. 
Keywords: modifier genes, cystic fibrosis, genome wide association studies, candidate gene 
approach, genotype, phenotype 

Introduction 

The mechanisms responsible for the determination and fine-tuning of phenotypes are 
still not well understood, albeit it is widely accepted that the specific disease genotype, 
epigenetic factors, and environmental influences are significantly involved. However, it 
has become increasingly apparent that modifier genes must play a considerable role in the 
phenotypic heterogeneity of Mendelian disorders, reflected for instance by intrafamilial 
variations, incomplete penetrance, and varying severity of the clinical course. New genetic 
technologies and advances in molecular medicine allow huge amounts of information to 
be generated from individual samples in the form of genomic, transcriptomic, and pro- 
teomic data. However, analysis and interpretation of this flood of findings, most notably 
putting them into the right biological context, represents one of the main challenges for 
research and medicine today. The purpose of this review is to provide insight into the role 
of modifier genes as an important source of phenotypic variability in so-called monogenic 
disorders, using the example of cystic fibrosis (CF). Further, it discusses the benefits and 
limitations of the actual knowledge as well as future technical and strategic approaches. 
However, since a comprehensive discussion of all modifier gene studies is beyond the 
scope of this review, the main focus will rather be on recent progress made in the field. 

Modifier genes 

Determination of the genotype of a single locus is usually not sufficient to predict a 
clinical phenotype in human disorders considered to be inherited in simple Mendelian 
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patterns, assuming no clear distinction between monogenic 
and complex traits or involvement of other molecular 
mechanisms. Thus, the traditional view of genetic disor- 
ders being either monogenic or multifactorial is no longer 
supported and requires a change in focus towards factors 
that modify gene expression and disease course (Figure 1). 
Functioning in a common cellular environment, genes are 
expected to interact with and influence each other. The inter- 
action between genes at different loci in a such way that they 
combine to change the disease phenotype is called epistasis, 1 
and the genes involved are genetic modifiers. Nowadays, it 
is well recognized that epistasis is a ubiquitous component 
of human disease and that complex interactions play a more 
significant role than the independent main effects of any 
single susceptibility gene. 2 Modifier genes can influence 
the phenotypic outcome of a given genotype via the same, 



a related, or a parallel biological pathway as the disease 
gene. 3 - 4 Moreover, for any given disorder, multiple modi- 
fier genes may contribute to the phenotype in a cumulative 
way and have the potential to alter penetrance, expressivity, 
pleiotropy, and severity, including age of disease onset and 
progression rate. Modifier genes can enhance the primary 
defect (synergistic epistasis), leading to a more severe phe- 
notype, or reduce it (antagonistic epistasis) and thus have a 
protective effect. However, the search for modifier genes 
is very challenging and disease associations are difficult to 
establish, because the contribution of a single modifier to the 
phenotype may be small and can be masked by other influ- 
encing factors, such as environment and ethnicity, as well 
as individual genetic background. Thus, no single genetic 
approach meets all conditions required for comprehensive 
and successful identification of modifier genes in monogenic 




Figure I Scheme of the relationship between genotype and phenotype, and the influence of additional factors such as modifier genes, entire genetic background, epigenetic, 
and environmental effects on cystic fibrosis. 

Notes: In primarily monogenic disorders, the phenotype is significantly defined by the disease-causing mutations while direct and indirect modifier genes followed by the 
entire genetic background, epigenetics, and environment essentially induce phenotypic variability. Multifactorial and complex diseases, however, are mostly the result of one 
or several genetic predispositions triggered by exogenous factors. Yellow circles represent modifiers enhancing CFTR disease severity, and blue circles represent modifiers 
with a protective effect against CFTR. The beige circles represent the genetic background, and the red shapes show the methylation of specific genes (epigenetic influences). 
Direct modifiers are shown without an arrowhead, and indirect modifiers with an arrowhead. A range of environment effects (eg, chemicals, climate, drugs, infections, 
radiation) are shown in the outer circle. 

Abbreviation: CFTR, cystic fibrosis transmembrane conductance regulator. 
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diseases, and the combination of classical approaches with 
genome-wide screenings may offer the best strategy. 

Identification of modifier genes 

Different mechanisms might explain the variability of mono- 
genic disease expression such as genetic heterogeneity, addi- 
tional variant(s) in the disease gene in cis or trans position to 
the primary mutation(s), modifier genes, epigenetic changes 
(DNA methylation, histone modification), and environmental 
conditioning. Searching for modifier genes requires a different 
approach than looking for genes primarily involved in develop- 
ment of a disease. The first and most important step is to define 
the clinical phenotype of a disease that represents the variability 
between affected individuals and is expected to be modified by 
factors other than the primary defect. The second challenge 
is then to choose the corresponding patient cohort. Strategies 
to identify modifier genes depend on the data available, and 
include linkage analyses as well as association studies. 

Approaches are either hypothesis-driven, focusing on 
specific candidate genes and candidate pathways, or represent 
a blind search systematically scanning the whole genome. 
The use of single nucleotide polymorphism (SNP) arrays 
including more than one million markers randomly chosen to 
cover the genome recently allowed genome-wide association 
(GWA) studies to be performed in complex and monogenic 
diseases; however, to obtain the statistical evidence needed to 
prove an association between a sequence variant and a disease 
phenotype, very large samples of patients and control indi- 
viduals are required a condition that is very hard to fulfill. 5 
Therefore, for many studies, choosing the candidate gene 
approach will be more feasible and effective. Candidate genes 
may be part of the interactome of the primary disease gene 
or, alternatively, belong to another pathway that is indirectly 
involved in the disease course (eg, the inflammatory pathway 
in CF). Thus, resequencing of the selected genes in a carefully 
characterized patient cohort with well defined phenotypes 
remains a promising strategy. In summary, the challenge 
for studying monogenic disorders is to assess the relative 
contribution of modifier genes to the disease phenotype and 
to reliably identify these genes, bearing in mind that multiple 
modifier genes may create a cumulative effect on the phe- 
notypic expression and that the combination of these genes 
may vary between individuals. 

As one of the most common Mendelian disorders and 
presenting with a broad clinical heterogeneity, CF is one 
of the best examples to demonstrate strategic approaches 
for the identification of modifier genes and to explain their 
involvement in phenotypic expression. 



Cystic fibrosis 

CF is the most common lethal autosomal recessive disorder 
in the white population, with an incidence of one in around 
2,500 and a carrier frequency of one in 25. Thus, in every 
625th couple, both partners are carriers, facing a one in four 
risk of having a child with CF. The main clinical symptoms 
of CF are elevated sweat chloride concentrations, exocrine 
pancreatic insufficiency (in 85%-90% of patients), male 
infertility, and progressive obstructive lung disease. 67 
Nowadays, lung disease is the major cause of morbidity and 
mortality in CF, arising from chronic bacterial infections 
with a persistent neutrophilic inflammatory response and 
leading to airway damage, bronchiectasis, emphysema, and 
finally to respiratory failure. 8 Additional complications of 
CF include meconium ileus in the neonatal period affecting 
about 1 5% of newborns with CF, 9 severe hepatic cirrhosis and 
portal hypertension in 3%-5% of patients, and CF-related 
diabetes, that develops in only 2% of children but in 19% of 
adolescents and over 40% of adults with CF. 10 Pancreatitis 
is also a well known albeit less common manifestation, that 
occurs in 15%— 20% of pancreatic-sufficient CF patients." 
Moreover, nearly all patients present with sinus disease 
(chronic sinusitis and/or nasal polyposis), but not all are 
symptomatic. 12 During recent decades, the life expectancy for 
patients with CF has improved significantly, from less than 
one year to more than 35 years, due to specific treatments, 
including pancreatic enzyme and vitamin supplementation, 
antibiotics, and inhaled bronchodilators and steroids, as 
well as autogenic drainage and aerobic exercise, indicating 
a considerable effect of nongenetic environmental factors 
on disease variability. 

In addition to the patients with the classical life- 
shortening form of CF, there are individuals with cystic 
fibrosis transmembrane conductance regulator (CFTR)- 
related metabolic syndrome or with monosymptomatic 
disorders, such as congenital bilateral absence of the vas 
deferens, acute recurrent or chronic pancreatitis, and dis- 
seminated bronchiectasis. This clinical entity of disorders 
that do not meet the diagnostic criteria of CF but are associ- 
ated with CFTR dysfunction is classified as CFTR-related 
disorders. 13 

CFTR gene and protein 

The gene causing CF, identified in 1989, 1415 is located 
on chromosome 7q31.2, spanning a transcription unit 
of -216.7 kb and containing 27 exons. 16 It is transcribed 
into 6. 1 kb messenger ribonucleic acids (mRNAs) encoding 
the transmembrane CFTR protein of 1 ,480 amino acids. The 
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main function of CFTR is that of a cyclic AMP-regulated 
chloride channel located at the apical membrane of polar- 
ized epithelial cells that is expressed in several functionally 
diverse tissues, including the lungs, sweat ducts, pancreas, 
gastrointestinal tract, and vas deferens. 1718 Moreover, 
CFTR directly mediates secretion of bicarbonate across the 
apical membrane linking ion transport and luminal pH, with 
mucin secretion, plugging, and retention, which are the 
hallmarks of CF pathology. 19 21 CFTR-mediated chloride 
secretion across epithelial cells is controlled by both 
modulating channel activity and regulating the total num- 
ber of CFTR channels in the membrane. 22 Alterations in 
the synthesis or sequence of the CFTR protein may affect 
the number of channels in the plasma membrane, channel 
activity, and/or intracellular trafficking of CFTR. 

Genotypic and phenotypic 
heterogeneity 

Nearly 2,000 different sequence variants have been reported 
so far to The Cystic Fibrosis Genetic Analysis Consortium 
(CFTR1 database, http://www.genet.sickkids.on.ca/cftr/ 
app ). The most common defect causing CF is the F508del 
mutation (c.l521_1523delCTT; p.Phe508del according 
to the current standard nomenclature), a 3 bp deletion in 
exon 10 causing loss of the amino acid phenylalanine at 
position 508 of the protein. 23 Worldwide, this allele accounts 
for -60% of all CF chromosomes (range 27%-87.5%), with 
considerably variable frequencies depending on populations 
and geographic locations. 24 There are another 23 relatively 
common mutations (frequency >0.5%) worldwide and a 
few mutations with an unusually high frequency in specific 
populations, indicating a founder effect and genetic drift. The 
remaining mutations represent rare or even individual alleles 
that are distributed throughout the entire gene. Mutations 
(missense, nonsense, frameshift, splice, small and large in- 
frame deletions and insertions) contribute to the phenotype 
by their nature and position in the gene and can be classified 
by their molecular effects on CFTR. 25 - 26 Class 1 mutations are 
defined as loss of function, and thus are expected to result 
in no functional CFTR protein and to be associated with a 
severe phenotype. Class 2 mutations produce misprocessed/ 
misfolded proteins (eg, F508del), class 3 mutations affect 
activation or gating of the CFTR channel (eg, G55 ID), class 4 
mutations reduce ion conductance through the channel pore 
(eg, Rl 17H, R334W, R347P), and class 5 mutations reduce 
the amount of functional CFTR either by partially aberrant 
splicing (eg, 3849 + lOkbOT, 5T) or by inefficient traffick- 
ing (eg, A455E). Class 6 includes nonsense and frameshift 
mutations (eg, Q1412X, 4326delTC, 4279insA) causing 



a 70-100 bp truncation of the C-terminus of the CFTR that 
leads to a marked instability of an otherwise fully processed 
and functional protein and as a consequence to a severe CF 
presentation. 27 Mutations in any of the classes 2-5 display 
a broad range of phenotypic effects, making prediction of 
the clinical course of a patient impossible simply based on 
mutation classification. Moreover, the potential of a mutation 
to contribute to the phenotype depends not only on its nature, 
localization in the gene, and molecular mechanism, but also 
on its interaction with the second mutated CFTR allele and 
on disease modifiers. Although CF is considered a monogenic 
disorder, studies of clinical phenotype in correlation with 
genotype have revealed a very complex relationship. Some 
phenotypic features are closely determined by the genotype in 
an essentially monogenic fashion, whereas others are strongly 
influenced by both modifying genetic factors and the envi- 
ronment. There is a close relationship between the CFTR 
genotype and the pancreatic phenotype, revealing "severe" 
mutations (eg, F508del, all class 1 mutations) to be associ- 
ated with pancreatic insufficiency and "mild" mutations, such 
as a series of missense and alternative splice mutations, to 
be associated with pancreatic sufficiency. 28 - 29 Interestingly, 
pancreatic-sufficient patients carrying two mild CFTR muta- 
tions are at significantly higher risk of developing pancreatitis 
than patients with moderate or moderate/severe genotypes 
correlating with the residual CFTR function and the pancre- 
atic acinar reserve. 11 On the other hand, because of its com- 
plexity and patient exposure to a multitude of endogenous and 
exogenous factors, the pulmonary outcome is clinically the 
most variable as well as the most unpredictable component 
of the CF phenotype. Studies focusing on pulmonary status 
as a function of the F508del allele have reported a wide range 
of effects, from a detectable impact of CFTR genotype to no 
or statistically insignificant influences. 30 - 31 Other studies using 
more refined assessment of CFTR mutations have shown 
statistically significant correlations between CFTR genotypes 
and pulmonary status, 32 whereas still others have failed to 
detect a significant association. 33 From my own studies, we 
can conclude that there is a significant association between 
specific genotypes and severity of pulmonary disease, and that 
F508del homozygous patients show the most considerable 
variation in severity of pulmonary disease. 34-36 The impact of 
CFTR genotypes has also been investigated in other organs, 
such as the sweat glands and the reproductive tract in men. 37,38 
One major conclusion resulting from these studies was that 
expression of CFTR in different organ systems of the same 
individual is highly variable, and that therefore each organ 
may be variably affected, making diagnostic and therapeutic 
approaches much more complicated. 
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Despite all efforts, the clinical and functional relevance 
of most CFTR mutations has not been described so far. 
However, to address this gap, a genotype-phenotype data- 
base ( http://www.cftr2.org ) has been established including 
nearly 39,700 CF patients, that comprises around 190 
CFTR mutations and gives evidence whether a mutation 
can be classified as causing CF disease, not causing CF, or 
if varying clinical consequences can be expected. This is an 
ongoing project and an important addition to the traditional 
CFTR1 database. 

Taken together, all these findings point to the understanding 
that a CFTR genotype constitutes only the source or potential 
for CF disease, which will to varying degrees be expressed 
and translated into CF pathophysiology. Thus, classic 
genotype-phenotype studies are important but not sufficient 
and have to be complemented by the search for environmental 
and non-CFTR genetic effects on clinical phenotype in order 
to enhance the knowledge of disease mechanisms and to 
develop new diagnostic and therapeutic approaches. 

Role of genetic modifiers in CF 

Previous findings indicating a strong influence of genetic 
modulators on CF phenotype 39 ^ 1 and identification of 
macromolecular complexes regulating the intracellular 
trafficking and activity of CFTR 42 - 43 emphasize the need to 
focus on the detection of the genes and biological mecha- 
nisms involved and provide the well founded basis for 
extended GWA and candidate gene studies. Two different 
approaches have been used to identify modifiers in CF, ie, 
case-control association studies on the one hand and family- 
based linkage studies on the other. As both methods have 
their advantages and disadvantages, eg, higher robustness 
regarding detection of common genetic variants (case- 
control association studies) or better differentiation between 
environmental versus genetic effects (family-based studies), 
complementary studies employing both methodologies are 
highly recommended. 44 Moreover, any association between 
a chromosomal region, a specific gene locus, or a single gene 
variant should if ever possible, be replicated in one or more 
separate patient and control cohorts, 10 although differences 
in ethnic background treatment procedures, environmental 
factors, and socioeconomic influences may weaken or even 
hide any significant findings. 

Genome-wide approaches 

CF lung disease 

The North American Cystic Fibrosis Gene Modifier 
Consortium performed a combined GWA and linkage study 
in a total of 3,467 CF patients and identified significant and 



replicated association with loci within the chromosomal 
regions llpl3 and 20ql3.2 modifying lung disease sever- 
ity and harboring genes of biological relevance for CF. 45 
The strongest signal was located at 1 lp 13, in the intergenic 
region 3' to the genes APIP (APAF1 -interacting protein) 
and EHF (ETS homologous factor) with regulatory features 
and correlation of expression of nearby genes such as EHF 
to ELF 5, EHF to APIP and APIP to PDHX. The EHF gene 
encodes a protein that belongs to an ETS transcription factor 
subfamily characterized by epithelial-specific expression 
and acts as an important regulator of epithelial differen- 
tiation under stress conditions and inflammation. Very 
recently, EHF has been confirmed as a modifier of the CF 
phenotype in the patient cohort of the European CF Twin 
and Sibling Study by altering capabilities of the epithelial 
cells to correctly process folding and trafficking of mutant 
CFTR. 46 The second gene within the 1 lp 1 3 region, APIP, 
inhibits cytochrome c-dependent and APFA 1 -mediated cell 
death and prevents ischemic/hypoxic injury. It is suggested 
that increased expression of APIP resulting in exaggerated 
inhibition of apoptosis may worsen CF lung disease, sup- 
porting the concept that inhibition of apoptosis in the airways 
leads to delayed clearance of neutrophils and consequently 
to a hyperinflammatory state, and contributes to goblet cell 
metaplasia, representing a feature of CF airway disease. 47,48 
SNPs in the intergenic region of 1 lpl3 may impair expres- 
sion of EHF and APIP, but also that of other proximate 
genes such as ELF5 and PDHX, so studies are ongoing to 
elucidate the mechanism of disease pathogenesis resulting 
from genomic polymorphisms. 44 

Five genes {CBLN4, MC3R, CASS4, CSTF1, AURKA) 
were localized within the 20pl3.2 linkage region, all of 
them being expressed in either fetal or adult lung or bron- 
chial epithelial cells. CASS4, for instance, is a member of 
the CAS scaffolding protein family and plays a role in the 
tyrosine kinase-based signaling regulating cell adhesion and 
spreading. 49 AURKA encodes an Aurora-related kinase A that 
is involved in various mitotic events, such as establishment 
of the mitotic spindle, centrosome maturation, chromosomal 
alignment, and cytokinesis, as well as in proper disassembly 
of primary cilia. 50 The most interesting among the five genes, 
however, is MC3R, which has been implicated in regulation 
of energy balance and modulation of neutrophil accumula- 
tion in lung inflammation, both of which are key features 
of CF lung disease. 51 - 52 Studies are underway to identify the 
mechanisms of lung disease that may reflect the linkage 
signal at 20pl3.2. 44 

In addition, there were six other regions of suggestive 
association, among them the HLA class II DR locus 
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on 6p21.3, which is well known to be involved in the 
manifestation of proinflammatory and autoimmune disorders, 
such as rheumatoid arthritis, type 1 diabetes, and multiple 
sclerosis, and the highly conserved ancestral haplotype 8.1 
encoded in the HLA region has recently been reported to be 
associated with the severity of CF lung disease. 53 

In an exome-wide approach, Emond et al 54 sequenced 
43 CF patients with early onset of chronic Pseudomonas 
aeruginosa infection (<5 years) as well as 48 individuals 
who had never had P. aeruginosa infection by age 
14 years or beyond. Missense variants in a single gene, 
DCTN4 (dynactin 4), on chromosome 5q33.1 were found 
to be significantly associated with early age of chronic 
P. aeruginosa infection as well as of the first mucoid 
P. aeruginosa culture discovering DCTN4, a component 
of the dynein-dependent motor and part of the autophagy 
process, as an important genetic modifier of the CF 
phenotype. Although this strategy usually requires analysis 
of larger sample sizes, exome sequencing is an additional 
promising tool for identification of disease modifiers in cases 
of extreme phenotype study design (Tables 1 and 2). 

CF in intestine cells 

Using genome-wide analyses, Blackman et al 55 identified 
several chromosomal regions suggesting contribution to 
(4q35.1, 8p23.1, llq25) or protection from (20pll.22, 
21q22.3) development of meconium ileus. By employing 
both linkage and transmission methods in a family-based 
study, followed by replication in an unrelated sample of CF 
patients, Henderson et al 56 confirmed a strong association 
between a three-polymorphism haplotype corresponding 
to the MSRA gene on chromosome 8 and meconium ileus. 



Another linkage study replicated in an independent sample 
of CF families provided evidence for a locus promoting 
meconium ileus on chromosome 12pl3.3 and pointed to the 
involvement of the adiponectin receptor 2 gene (ADIPOR2). 
Moreover, a protective locus was mapped to chromosome 
4ql3.3, including the sodium bicarbonate cotransporter 
SLC4A4 as a candidate gene. 57 A conventional GWA study 
identified the association of several SNPs within two regions, 
including SLC2 6A9 on chromosome 1 (lq32.1) mdSLC6A14 
on chromosome X (Xq23-24), with meconium ileus in 3,763 
affected individuals and these associations were replicated in 
an independent combined patient cohort from North America 
and France. A subsequent hypothesis-driven GWA study 
confirmed SLC26A9 and SLC6AJ4 and in addition SLC9A3 
as modifiers of meconium ileus, but also provided evidence 
of a collective association of multiple constituents of the 
apical membrane with meconium ileus. 58 

In another approach, body mass index data collected 
from the CF Twin-Sibling Study (ClinicalTrials.Gov 
identifier NCT00037778) 41 were analyzed to define the 
contribution of genetic modifiers to the nutritional status of 
young CF patients aged 5-10 years. 59 For 1,010 individuals 
with pancreatic insufficiency, significant linkage of body 
mass index and the two chromosomal regions lp36.1 and 
5ql4 was found, the latter including the arrestin domain- 
containing 3 (ARRDC3) gene, a regulator of body mass and 
energy expenditure in males. 60 

CF-related diabetes 

A GWA study including 3,059 CF patients identified an 
association of SNPs within and 5' to the SLC26A9 gene 
with CF-related diabetes. These findings were replicated in 



Table I Direct modifiers of the CFTR interactome 



Locus 


Gene 


Function 


Phenotype 


Method 


Reference 


I2ql3.l3 


KRT8 


Cytoskeleton remodeling 


Disease severity 


CGA 


98 


I2ql3.l3 


KRTI8 


Cytoskeleton remodeling, delivery of mutated CFTR to the plasma 
and in IL-6-mediated barrier protection 


Disease severity 


CGA 


98 


I7q2l.2 


KRTI9 


Cytoskeleton remodeling, organization of myofibers 


Lung disease 


CGA 


80 


1 9q 1 3.4 1 


PPP2RIA 


Regulation of CFTR channel activity 


Lung disease 


CGA 


80 


9q34.l 1 


PPP2R4 


Regulatory subunitfor serine/threonine-protein phosphatase 2A 


Lung disease 


CGA 


80 


1 6p 1 2.2 


SCNNIB 


Electrodiffusion of the luminal sodium through the apical membrane 
of epithelial cells 


Lung disease 


CGA 


81 


1 6p 1 2.2 


SCNNIG 


Electrodiffusion of luminal sodium through the apical membrane 
of epithelial cells 


Lung disease 


CGA 


81 


5pl5.33 


SLC9A3 


Important role in signal transduction, sodium/proton exchanger 


Ml + lung 


GWAS/CGA 


58,76,76,99 


I5ql5.l 


SNAP23 


Important regulator of transport vesicle docking and fusion 


Lung disease 


CGA 


80 


7ql 1.23 


STXIA 


Docking of synaptic vesicles at presynaptic active zones 
Mediates Ca 2+ regulation of exocytosis acrosomal reaction in sperm 


Lung disease 


CGA 


79 



Abbreviations: CFTR, cystic fibrosis transmembrane conductance regulator; Ml, meconium ileus; CGA, candidate gene approach; GWAS, genome-wide association study; 
IL, interleukin. 
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Table 2 Indirect modifiers of different pathways 



Locus 


Gene 


Pathway/function 


Phenotype 


Method 


Reference 


I7q23.3 


ACE 


Metabolism, peptide hormone/electrolyte balance, vasopressor 


Liver disease 


CGA 


93 


I2p 13.33 


ADIP0R2 


Metabolism, lipids, and lipoproteins/adiponectin receptor 


Ml 


GWAS 


57 


6p2l.32 


ACER 


Immune system, innate/inflammatory response 


Lung disease 


CGA 


82 


1 1 p 1 3 


APIP 


Metabolism, amino acids/inhibitor of apoptosis 


Lung disease 


GWAS 


45,46,48,49 


5ql4.3 


ARRDC3 


Metabolism, lipids and lipoproteins/regulator of body mass and 


BMI 


GWAS 


60 






energy expenditure 








20q 1 3.2 


AURKA 


Mitotic cell cycle/cytokinesis and disassembly of primary cilia 


Lung disease 


GWAS 


46,51 


2q37.3 


CAPN10 


Signal transduction/calcium-regulated protease 


Diabetes 


CGA 


90 


20q 13.31 


CASS4 


Signal transduction/tyrosine kinase-based signaling 


Lung disease 


GWAS 


46,50 


20q 1 3.2 


CBLN4 


Cell-cell communication/synaptic function 


Lung disease 


GWAS 


46 


6p22.3 


CDKALI 


Gene expression/tRNA processing 


Diabetes 


GWAS 


61 


9p2l .3 


CDKN2B 


Mitotic cell cycle/signaling by TGF-p receptor complex 


Diabetes 


GWAS 


61 


Ip3l-p22 


CLCA4 


Transmembrane transport of small molecules/mediator of 


Intestine 


CGA 


88,89 






calcium-activated chloride conductance 








20q 1 3.2 


CSTF1 


Gene expression/processing of intron-less pre-mRNA 


Lung disease 


GWAS 




2q35 


CXCR1 


Signal transduction/chemokine receptor, activation of neutrophils 


Lung disease 


CGA 


83 


2q35 


CXCR2 


Signal transduction/chemokine receptor, activation of neutrophils 


Lung disease 


CGA 


83 


5q3l-q32 


DCTN4 


Immune system, adaptive/dynein targeting 


P. aeruginosa 


EWAS 


62 


4q3 1 .22 


EDNRA 


Signal transduction/endothelin receptor, smooth muscle 


Lung disease 


CGA 


75 






proliferation 








1 1 p 1 3 


EHF 


Metabolism of proteins/regulation of epithelial cell differentiation 


Lung disease 


GWAS 


45,46,47 






and proliferation 








1 1 p. 1 3 


ELF5 


Metabolism of proteins/regulation of keratinocyte differentiation, 


Lung disease 


GWAS 


45,46 






ectoderm development 








1 1 ql 3.2 


GSTPI 


Metabolism, biological oxidations/glutathione conjugation 


Liver disease 


CGA 


94 


6q23.3 


IFNGR1 


Immune system/interferon gamma receptor 


Disease severity 


CGA 


97 


7q3l.l 


IFRDI 


Developmental biology/regulator of neutrophil effector function 


Lung disease 


CGA 


46,72 


3q27.2 


IGF2BP2 


Gene expression /regulation of translation of insulin-like growth 


Diabetes 


GWAS 


61 






factor 2 mRNAs 








2ql4 


ILIB 


Immune system/cytokine signaling, inflammatory response 


Disease severity 


CGA 


97 


4ql3.3 


IL-8 


Signal transduction/chemokine signaling, airway inflammation 


Lung disease 


CGA 


72-74 


lOql 1.2 


MBL2 


Immune system, innate/activates the lectin complement pathway 


Lung + liver + 


CGA 


10,45,63,64 








Pseudomonas 






20q 1 3.2 


MC3R 


Signal transduction/energy balance and neutrophil accumulation 


Lung disease 


GWAS 


46,52,53 


8p23.l 


MSRA 


Cellular response to stress/repair of proteins inactivated by 


Ml 


GWAS 


56 






oxidation 








1 1 pi 5.5 


MUC5AC 


Metabolism of proteins/protection of the mucosa from infection 


Lung disease 


CGA 


84 






and chemical damage 








1 1 p 1 3 


PDHX 


Metabolism of citric acid cycle/regulation of the pyruvate 


Lung disease 


GWAS 


45,46 






dehydrogenase complex 








I4q32.l3 


SERPINAI 


Hemostasis/inhibitor of serine proteases 


Liver disease 


CGA 


92,96 


4ql3.3 


SLC4A4 


Transmembrane transport of small molecules/electrogenic 


Ml 


GWAS 


57 






sodium/bicarbonate cotransporter 








Xq23 


SLC6AI4 


Transmembrane transport of small molecules/amino acid 


Ml + lung + 


GWAS 


58,99 






transport across the plasma membrane, uptake of basic and 


Pseudomonas 










neutral amino acids 








Iq32.l 


SLC26A9 


Transmembrane transport of small molecules /anion exchanger 


Ml + diabetes 


GWAS/ 


58,61,99 






mediating chloride, sulfate and oxalate transport 




CGA 




I7q2l 


STAT3 


Immune system/mediates cellular responses to interleukins and 


Intestine 


CGA 


97 






other growth factors 








I0q25.3 


TCF7L2 


Signal transduction/maintenance of the epithelial stem cell 


Diabetes 


CGA 


91 






compartment of the small intestine 








1 9q 1 3.2 


TGFBI 


Signal transduction/regulation of airway inflammation, extracellular 


Lung disease 


CGA 


10,70,71,95 






matrix organization 








Iq4l 


TLR5 


Immune system, innate/cytokine secretion and the inflammatory 


Lung disease 


CGA 


87 






response 









Abbreviations: BMI, body mass 
GWAS, genome-wide association 



index; Ml, meconium ileus; P. aeruginosa, Pseudomonas aeruginosa; CGA, candidate gene approach; EWAS, exome-wide association study; 
study; TGF-p\ transforming growth factor-beta; mRNA, messenger RNA; tRNA, transfer RNA. 
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694 individuals. In addition, SNPs for type 2 diabetes at or 
near CDKAL1, CDKN2A/B, and IGF2BP2 were found to be 
associated with CF-related diabetes as well. 61 As SLC26A9 
encodes an anion transporter conducting both chloride and 
bicarbonate, the authors speculate that "alteration in the 
expression of CFTR or SLC26A9 (or both) might affect 
(3-cell function directly by altering chloride or bicarbonate 
flow (or both)". 

Candidate gene studies 

CF lung disease 

Defective ion transport resulting in disturbed mucociliary 
clearance and, as a consequence, in acute and chronic 
infections followed by an inflammatory response, form 
the basis of lung disease in CF. Therefore, genes involved 
in these pathways are expected to contribute to the CF 
phenotype by modifying the clinical course and severity of 
lung disease. 

One of the first and most extensively studied modifier 
genes is MBL2, the gene that encodes mannose binding 
lectin (MBL) and that was selected as a candidate based 
on its critical role in the innate immune response and on 
the fact that MBL deficiency leads to a predilection for 
bacterial and viral infection. However, the findings of these 
numerous studies are in part contradictory, and the role of 
MBL2 as a genetic modifier of CF remains a matter of some 
debate. Six studies involving more than 2,000 CF patients 
demonstrated an association of MBL-deficient genotypes 
with more severe lung function, three studies reported earlier 
onset of P. aeruginosa infection, whereas several studies did 
not detect an association with lung disease and/or with age 
of infection with P. aeruginosa. In the most comprehensive 
study of MBL2 as a CF modifier, Dorfman et al 62 evaluated 
the association of MBL2 genotypes with CF pulmonary 
phenotype in a cohort of 1 ,0 1 9 Canadian pediatric CF patients 
and demonstrated that MBL2 deficiency was significantly 
associated with an increased risk of early infection with 
P. aeruginosa and the rate of decline of pulmonary func- 
tion in pediatric CF patients. Using a modeling framework 
considering confounding effects of related variables, such 
as lung disease severity, infection status, age, and CFTR 
genotype, McDougal et al 63 showed that the MBL2 genotype 
was associated with infection status rather than with the 
other variables and that MBL deficiency primarily results in 
earlier P. aeruginosa infection and conversion to the mucoid 
form, leading presumably to more severe lung disease and 
reduced survival. Most of these studies have been reviewed 
and included in a meta-analysis by Chalmers et al, 64 who 



concluded that the MBL2 gene is a modifier of CF lung dis- 
ease and that MBL insufficiency is associated with earlier 
acquisition of P. aeruginosa, reduced lung function, and a 
higher rate of end-stage CF. 

The gene second most often investigated for modifier 
effects is transforming growth factor beta 1 (TGFj3l), which 
is well known to be involved in the regulation of airway 
inflammation and remodeling 63 - 66 and to influence the risk 
for airways disease in asthma 67 - 68 and chronic obstructive 
pulmonary disease. 69 The first large study reporting an 
association between TGF(3l variants in the promoter 
(-509OT) and exon 1 (codon 10, c.29C>T) and severity 
of CF lung disease in a cohort of 808 F508del homozygotes 
was published by Drumm et al. 70 This finding was replicated 
in 498 patients with various genotypes and independently 
confirmed by Bremer et al, 71 demonstrating association of 
a TGF/31 haplotype (3' C allele (rs8179181/-509C/c.29T) 
with improved lung function in CF. A few small studies 
reported no or not the same association between TGFfil and 
CF lung function. 10 

Gu et al 72 identified the interferon-related developmental 
regulator 1 (IFRD1) gene as a modifier of the severity of 
CF lung disease. Via transcriptional mechanisms, IFRD1 
regulates the neutrophil effector function during bacterial 
infection and may modulate the pathogenesis of CF lung 
disease through this pathway. Polymorphisms in IFRD1 
showed association in a family-based (twin and siblings) 
population, but did not achieve genome-wide significance 45 
and have not been replicated in other CF populations. 

Another gene, apart from MBL2, TGFB] and IFRD1, that 
is well known to be associated with airways inflammation 
in CF and suggested to modify lung disease severity by 
affecting host ability to tolerate infection, 72 is interleukin-8 
(IL-8). A modifier effect of variants in IL-8 was first noted 
in a CF cohort of 737 F508del homozygous patients and 
found to be most strongly pronounced in males. 73 A second 
study in another 385 patients was able to confirm this finding 
for males; 73 however, a further analysis of 329 patients failed to 
detect an association between lung disease and IL-8 variants. 74 

A well replicated association with CF lung disease has 
been described for the variant 6672G>C(rs5335) in the 
3' UTR of the endothelial receptor type A (EDNRA) gene. 
The C allele was more commonly found in CF patients with 
severe lung disease in four separate CF cohorts and correlated 
with increased EDNRA mRNA levels, resulting in increased 
smooth muscle proliferation and subsequent worsening of the 
CF pulmonary phenotype by altering smooth muscle tone in 
the airways and/or vasculature. 75 
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As CFTR dysfunction leads to salt and water imbalance 
across airway epithelia and impaired mucociliary clearance, 
variants in the SLC9A3 gene encoding the sodium/hydrogen 
exchanger 3 (NHE3) were tested and found to be associated 
with susceptibility to bacterial infections and severity of CF 
lung disease. 76 SLC9A3 reciprocally interacts with CFTR 
through a regulatory complex including NHERF-2, ezrin, 
and protein kinase A, and therefore provides plausible genetic 
evidence of a modulatory effect. 77 Additional genes of the 
CFTR interactome, such as SCNN1B and SCNN1 G encoding 
the P-subunits and y-subunits of ENaC, 78 STX1A, 19 KRT19, 
PPP2R4, PPP2R1A, and SNAP23, m have been found to be 
involved in the modification of CF lung disease. While the 
modulating effect of SCNN1B and SCNN1G could not be 
confirmed, 81 the influence of STX1A on lung disease severity 
was replicated in an independent CF patient cohort and 
variant rs2228607 was shown to reinforce aberrant splicing 
of STX1A mRNA, triggering nonsense-mediated mRNA 
decay and indicating functional relevance of this variant. 79 
The findings by Gisler et al 80 are based upon longitudinal 
data for six different lung function parameters and are 
waiting replication; however, to have comparable longitu- 
dinal data from an independent CF patient cohort at hand 
is not simple. 

Further associations with CF lung disease have been 
described, but not yet replicated, for the variant -429T/C 
in the promoter of the AGER gene, for SNPs in CXCR1 and 
CXCR2, and for a 6.4 kb VNTR allele of the secreted mucin 
gene MUC5AC. The AGER allele -429C has been shown to 
lead to an increased promoter activity upregulating RAGE, 
that triggers the generation of reactive oxygen species and 
activation of signal transduction pathways, and exces- 
sive expression of which may result in increased airway 
inflammation and enhanced lung disease severity. 82 The 
chemokine receptors CXCR1 and CXCR2 play an important 
role in neutrophilic inflammation, and a specific CXCR1/2 
haplotype cluster has been identified that is associated with 
lung function in CF patients and is thought to modulate the 
pulmonary outcome through a dysregulation of neutrophilic 
effector functions. 83 As the MUC5AC gene may be involved 
in mucociliary clearance, Guo et al 84 speculate that the 6.4 kb 
severity-associated VNTR allele could alter the function of 
the airway mucus, affecting the inflammation status and 
progression of lung disease. 

Clinical trials have demonstrated that anti-inflammatory 
therapy may improve lung function and body weight in CF 
patients, 85 and emerging evidence indicates that inhibition 
of the flagellin receptor TLR5 normalizes the inflammatory 



response generated by CF airway epithelial cells after 
exposure to P. aeruginosa.* 6 To validate TLR5 as a dis- 
ease modifier and anti-inflammatory target, Blohmke 
et al 87 examined the relationship between the 7ZJ?5SNP 
C.11740T (rs5744168) leading to apremature stop codon 
and the two clinical parameters of lung function and body 
weight in 2,219 CF patients. They were able to demonstrate 
that the TLR5 allele c.l 174T significantly decreases TLR5 
responsiveness and associates with improved nutritional 
status in adults with CF, providing evidence that therapeutic 
strategies to inhibit TLR5 may be beneficial for the clinical 
outcome of CF patients (Tables 1 and 2). 

CF in intestinal cells 

The predisposition to meconium ileus in CF is clearly 
genetically determined; however, the candidate gene 
approach has not been applied to this CF trait so far. 
The CLCA gene locus has previously been identified as 
a modulator of the basic gastrointestinal defect in CF. 88 
Using a customized SNP map with a total of 17 diallelic 
markers and a resequencing strategy for fine-mapping, 
Kolbe et al 89 detected significant differences in allele dis- 
tribution for markers within the promoter of the CLCA4 
gene among F508del homozygous patients with either no, 
CFTR-mediated, or Ca 2+ -mediated residual chloride con- 
ductance. These findings suggest that CLCA4 modulates the 
capability to express residual chloride secretion in colonic 
tissue and therefore acts as a modifier of the basic defect 
in intestinal cells of CF patients. 

CF-related diabetes 

The fact that diabetes is a common complication of CF and 
strongly influenced by modifier genes and that a family history 
of type 2 diabetes increases the risk for CF-related diabetes 
raises the need to test T2DM-associated genes as potential 
modifiers of CF-related diabetes. Analyzing six genes con- 
tributing to the manifestation of type 2 diabetes, Derbel et al 90 
presented the first evidence for an involvement of a polymor- 
phism in the CAPN10 gene in the pathogenesis of CF-related 
diabetes. A further candidate gene approach revealed the modi- 
fying effects of a variant in the TCF7L2 gene 91 that associates 
with both type 2 diabetes and CF-related diabetes, supporting 
the concept that diabetes develops in patients who may have 
underlying susceptibility to (3-cell dysfunction. 

CF-related liver disease 

Only a subset (3%-5%) of CF patients develops severe 
liver disease (CFLD), characterized by cirrhosis with portal 
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hypertension indicating contribution of non-CFTR genetic 
variability to the risk for CFLD. The largest two-stage study 
focused on nine functional variants in five candidate genes 
(SERPINA1, ACE, GSTP1, MBL2, TGFpl) previously stud- 
ied in CFLD, but including only small and heterogeneous 
patient cohorts with and without portal hypertension. 92 " 95 
Two large populations of CF patients with (128/971 
and 139/1,227, respectively) and without (843/971 and 
1 ,088/1 ,227, respectively) liver disease and portal hypertension 
were tested and only the SERPINA1 Z-allele was found to 
significantly associate with CFLD and portal hypertension. 96 
TGF/31 variants showed association with CFLD in stage 1 , but 
did not replicate in stage 2 of the study. 

Manifestation and seventy of CF disease 

An association study comparing F508del homozygous CF 
patient subsets selected for an extreme clinical phenotype 
and/or their basic defect manifestation tested the genes 
STAT3, IL1B, and IFNGR1 as modifiers of CF disease. 
Evidence for allelic association was found for STAT 3 and 
CFTR-mediated residual chloride secretion in intestinal 
tissue, for IL1B and disease severity, and for IFNGR1 and 
intrapair discordance, implying that immunorelevant path- 
ways and ion secretion merge at the level of epithelial cells 
to integrate the signaling of cytokines due to innate and 
acquired immune defense. 97 

Another candidate gene approach investigated the 
two neighboring cytokeratin genes KRT8 and KRT18 in 
F508del homozygous sibling pairs and unrelated index 
cases selected for their extreme clinical or electrophysi- 
ological phenotype characterized by intestinal current or 
nasal potential difference measurements. KRT8, but not 
KRT18, showed association with CF disease severity and 
CFTR-mediated residual chloride secretion. Absence of 
chloride secretion was determined by the recessive haplo- 
type 1 122 (rs 1907671, rs4300473, rs2035878, rs2035875), 
whereas the contrasting haplotype 2211 was dominant for 
the presence of CFTR-mediated residual chloride secretion 
and associated with a mild phenotype. The significance 
of these results for KRT8 is that modifiers of CF disease 
severity can be recognized through their association with 
the basic defect, which is less influenced by environmental 
factors. 98 

Li et al 99 tested the hypothesis that the same risk alleles 
for meconium ileus in SLC26A9, SLC9A3, and SLC6A14 
are pleiotropic for severity of lung disease, age at first 
P. aeruginosa infection, and early exocrine pancreatic 
disease. They found that SLC26A9 was pleiotropic for 



meconium ileus and pancreatic damage at rs75 12462, 
SLC9A3 for meconium ileus and lung disease at rs 1 7563 1 6 1 , 
and SLC6A14 for meconium ileus and both lung disease 
and age at first P. aeruginosa infection at rs3788766. 
These findings demonstrate that pleiotropy in a multiorgan 
disease has to be checked for and can be assessed in a single 
study sample. Evidence of pleiotropy for (modifier) genes 
may enable the development of complementary therapeu- 
tic strategies that will benefit multiple disease-associated 
tissues. 

Implications for other 
monogenic disorders 

The impact of disease-modifying genes is manifold because 
understanding the diverse genetic contribution to a disease 
phenotype has the potential to improve prognosis, to 
develop new therapeutic approaches, and to open the door 
to personalized medicine. The experience in CF modifier 
research represents a useful basis for other single gene 
disorders mainly regarding consistent phenotype measures 
and study designs. To identify modifying loci and genes and 
to replicate associations, both genome-wide methods and 
candidate gene approaches are needed including large sample 
sizes and phenotypically well defined homogeneous patient 
cohorts, respectively. Further, the role of genetic factors in 
the variability of a disease has to be evidenced by family 
studies. Genotyping of parents allows reliable assessment 
of descent and detection of transmission disequilibrium, 
whereas twin and sibling analyses are crucial to prove 
or exclude phenotypic correlation within families and to 
define the genetic model underlying variability in disease 
expression. 

It has to be considered that modifier genes may each con- 
tribute small effects to a disease phenotype, but may work in 
concert with environmental influences, resulting in an ampli- 
fied or diminished consequence. Thus, longitudinal studies 
of patients with monogenic disorders are needed to define 
key factors contributing to gene-environment interactions. 
Moreover, the admixture of ethnic groups plays an important 
role in the characterization of modifying effects, as modifier 
genes are diverse in different ethnicities and their complexity 
increases in ethnically admixed populations according to the 
extent of gene flow. 2 

The increase in modifier gene identification as a 
consequence of new genetic technologies leads to the 
conclusion that epistasis may represent a principal mecha- 
nism explaining the individual phenotypic variability of most 
genetic diseases. 
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Diagnostic and therapeutic 
significance 

Currently, it is very difficult to decide if a CFTR muta- 
tion is disease-causing or rather a neutral variant and if a 
specific gene is modifying the CF phenotype or not. Thus, 
correlation of genotype and basic defect phenotype (CFTR 
protein) remains an essential approach to directly measure 
the influence of CFTR mutations on subsequent clinical 
manifestations. However, identification of risk alleles in 
modifier genes and understanding their contribution to 
phenotypic variation may open the possibility of defining 
risk factors early in life, allowing more accurate diagnosis in 
due time and providing better prediction of disease course. 
With the advent of next-generation sequencing, large num- 
bers of genes and genetic variants can easily be detected, 
enabling individual genotyping of primary disease genes and 
modifier genes in one single step and within a short time. 
Assuming correct understanding and interpretation of these 
data, complex genetic testing including the entire panel of 
variants known to contribute to the phenotype of a specific 
disease will significantly improve diagnosis for any mono- 
genic disorder. Moreover, such a diagnostic approach may 
also facilitate personalized treatment according to a given 
patient's genetic profile. For example, CF patients with a 
modifier allele constellation predicting early P. aeruginosa 
infection and conversion to the mucoid form may benefit 
from earlier and more rigorous antibiotic applications than 
those commonly used. 

The identification of genes that influence clinical phe- 
notypes opens a new field of therapeutic approaches in that 
not only proteins of primary disease-causing genes but also 
of disease modifiers can be targeted to potentiate residual 
activities of the primary gene products or to fully compensate 
for their defective function. However, it has to be taken into 
account that effects of modifiers of CF and other monogenic 
disorders may be limited to an individual disease phase, 
age, sex, or a specific organ or tissue. Using CF again as 
an example, anti-inflammatory therapy may not have the 
expected effect in young patients due to the unwanted sup- 
pression of immune responses to sporadic infections, while 
the same therapy could improve the health of older patients 
with established chronic infections. 100 Therefore, strategies 
for the development of new drugs and therapies via modi- 
fiers should focus on specific, well defined patient groups 
(eg, specific age, disease severity, genotype) that are likely 
to benefit most from the therapeutic application. Moreover, 
the sequential application of different drugs according to, 
eg, age, disease stage, and/or a combination therapy (several 



drugs at the same time) may result in the highest response 
rate and best improvement in health. 

Conclusion 

Key reasons for identifying disease modifiers are to improve 
prognosis of disease course and outcome and to aid the devel- 
opment of therapies. Even though the primary defect in CF is 
monogenic by definition, the manifestation of endophenotypes 
such as diabetes or liver disease as well as disease severity 
are rather polygenic by nature, meaning that searching for 
modifiers in CF or any other monogenic disorder raises the 
same hurdles as characterization of other complex diseases, 
such as asthma or cardiovascular disorders. Nevertheless, the 
sum of modifier genes influencing the CF phenotype that has 
been reported so far is impressive and provides novel insight 
into biological pathways and disease pathophysiology, paving 
the way to the understanding of other genetic disorders. 
Exome-wide and genome-wide analyses will become routine 
in the near future and may improve diagnosis and phenotype 
prediction of genetic disorders by including all genes known 
to be involved in the manifestation of a specific disease. In 
addition, proteins encoded by genes that have been proven 
to modify the disease course will become promising targets 
for therapeutic interventions. 

In summary, modifier research opens new diagnostic, 
prognostic, and therapeutic perspectives, leading at the same 
time to the understanding that "there is no such thing as a 
"single" disorder" and that "genetic diseases represent a 
continuum with diminishing influence from a single primary 
gene influenced by modifier genes, to increasingly shared 
influence by multiple genes", 101 making genetics even more 
complex and complicated than before. 
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